CDS

Accession Number TCMCG019C21225
gbkey CDS
Protein Id XP_022951947.1
Location complement(join(3515982..3516311,3516914..3517077,3517171..3517279,3517351..3517500,3517572..3517649,3517723..3517923,3518002..3518214,3518980..3519207))
Gene LOC111454680
GeneID 111454680
Organism Cucurbita moschata

Protein

Length 490aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA418582
db_source XM_023096179.1
Definition endoglucanase 16-like [Cucurbita moschata]

EGGNOG-MAPPER Annotation

COG_category G
Description Belongs to the glycosyl hydrolase 9 (cellulase E) family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R06200        [VIEW IN KEGG]
R11307        [VIEW IN KEGG]
R11308        [VIEW IN KEGG]
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01179        [VIEW IN KEGG]
EC 3.2.1.4        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00500        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00500        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGGGACCATCATCCACAATCTTTCTGCCATGGCCATTCTTGTGGGTTCCCTTCTTTTCTTTCAACCATTCTTTCCCGGGGCTGTTCATGCCGCCGACTTCAATTACAAAGATGCCCTCACCAAATCTCTTATTTTTCTCGAGGCTCAACGCTCCGGCAAGCTCCCGCCCAATCATCGCCCCGCTTGGAGAGGCGACTCCGCCCTCGACGATGGAAAACTTGCCAATGTAGACCTAGTAGGAGGGTATTACGACGCCGGAGACAACGTGAAGTACGGACTTCCGATGGCTTTCACGGTAACAACTCTATCATGGGGAGCTTTGACTTACCCAGCGGAGCTGGAAGCCGCCGGCGAAATGGAAAATCTTAAAGCCGCCATCAAATGGGGCACCGATTATTTCCTCAAAGCCTCTTCTCATCGCGATCGTTTATATGTCGAGGTCGGAGACCCCGTTAAGGATCACGAGTGTTGGGTTAGACCTGAAAATATGAAGACTCCAAGGACCGTATTGCAAATTGATTCCGAGACCCCCGGTACAGAAATTGCTGCCGAAACCTCCGCCGCCATGGCTTCGTCTTCCATCGTCTTCCGACACTCCAATCAAACATATGCTCGTCTTCTTCTCAACAAAGCTAAAACGCTTTATAAATTCGCAAAAGCCCACAAGGCAACTTACGATGGCGAGTGCCCTTTCTATTGCTCGTACTCGGGCTACAATGACGAGTTGTTGTGGGCTGCAACATGGCTATACGTCGCAACGAGGAAGTCGGTTTATTTGAAGTATGTTCTAGAAGAGTCGATTAGTGCTAGTGTAGCTGAATTCAGCTGGGATCTCAAATATGTTGGAGCTCAAGTTCTTCTTTCCAAGTTATATTTTGAAGGAGAGAAGGGTTTAGAGACGTTCAAAAATCAGGCAGATAGCTATATTTGTTCTAATCTTCCGACCAGCCCTTACCACCAAATTTACGTGTCTCCAGGGGGAATGGTTCACATGAGAGATGGAGCGAATACGCAATATGTTACGGGAACGGCGTTCGTATTTAGTGCTTATAGCGATATCCTTGCAGCCTATAAACAAAACGTTAAATGTAGTGACCAGCAGTTTGACCCGGCCCATCTCATGACTTTTGCTAAGAAACAGATGGATTACTTGCTGGGGGACAACCCACTAGGAAGATCGTTTATGGTAGGGTTTGGGAACAACCCACCAACGCAGGCGCACCACCGCGGCGCGTCGGTGCCAGTGATGCCAGCCAACGCAGAAGTGAACTGCCCAATGAGTTTCGTGAACTGGCTGAACAAGGACACGCCGAACCCCAACGAGCTGACGGGCGCAATTCTGGGCGGCCCGGACCGCAACGACAAGTTCTTAGACAAGCGTACGGTGTCACCCATGACGGAGCCGGTGACTTACACCAACTCCATGGCCGTGGGAGTGCTGGCAAAGCTGGCGGCCCACAAAATCACATGA
Protein:  
MGTIIHNLSAMAILVGSLLFFQPFFPGAVHAADFNYKDALTKSLIFLEAQRSGKLPPNHRPAWRGDSALDDGKLANVDLVGGYYDAGDNVKYGLPMAFTVTTLSWGALTYPAELEAAGEMENLKAAIKWGTDYFLKASSHRDRLYVEVGDPVKDHECWVRPENMKTPRTVLQIDSETPGTEIAAETSAAMASSSIVFRHSNQTYARLLLNKAKTLYKFAKAHKATYDGECPFYCSYSGYNDELLWAATWLYVATRKSVYLKYVLEESISASVAEFSWDLKYVGAQVLLSKLYFEGEKGLETFKNQADSYICSNLPTSPYHQIYVSPGGMVHMRDGANTQYVTGTAFVFSAYSDILAAYKQNVKCSDQQFDPAHLMTFAKKQMDYLLGDNPLGRSFMVGFGNNPPTQAHHRGASVPVMPANAEVNCPMSFVNWLNKDTPNPNELTGAILGGPDRNDKFLDKRTVSPMTEPVTYTNSMAVGVLAKLAAHKIT